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Abstract — We establish a lower bound on the noncoherent ca- 
pacity pre-log of a temporally correlated Rayleigh block-fading 
single-input multiple-output (SIMO) channel. Surprisingly, when 
the covariance matrix of the channel satisfies a certain technical 
condition related to the cardinality of its smallest set of linearly 
dependent rows, this lower bound reveals that the capacity pre- 
log in the SIMO case is larger than that in the single-input single- 
output (SISO) case. 

1. Introduction 

It is well known that the coherent-capacity pre-log (i.e., the 
asymptotic ratio between capacity and the logarithm of SNR, as 
SNR goes to infinity) of a single-input multiple-output (SIMO) 
fading channel is equal to 1 and is, hence, the same as that 
of a single-input single-output (SISO) fading channel [li|. In 
the practically more relevant noncoherent setting, where neither 
transmitter nor receiver have channel-state information, but both 
are aware of the channel statistics, the effect of multiple antennas 
on the capacit}[^pre-log is understood only for a specific simple 
channel model, namely, the constant block-fading model. In this 
model, the channel is assumed to remain constant over a block 
of T symbols and to change in an independent fashion from 
block to block [2|. For this model, the SIMO capacity pre-log is 
again equal to the SISO capacity pre-log, but, differently from 
the coherent case, is given by 1 — 1/T |'3l, 1*4^. 

A more general way of capturing channel variations in time is 
to assume that the fading process is stationary. In this case, the 
capacity pre-log is known only in the SISO |5 1 and the MISO |(6l 
Thm. 4.15] cases. The capacity bounds for the SIMO stationary- 
fading channel available in the literature |6, Thm. 4.13] do not 
allow one to determine whether the capacity pre-log in the SIMO 
case can be larger than that in the SISO case. 

In this paper, we focus on a channel model that can be seen as 
lying in between the general stationary-fading model considered 
in pi, 161, and the simpler constant block-fading model analyzed 
in Pi, p^. Specifically, we assume that the fading process is 
independent across blocks of length T and temporally correlated 
within blocks, with the rank of the corresponding T xT channel 
covariance matrix given bjj^ Q < T. For this channel model, 
referred to as the correlated block-fading model in the following, 
the SISO capacity pre-log is equal to 1 - Q/T |8||^The SIMO 
and MIMO capacity pre-logs are not known in this case. A 

'in the remainder of the paper, we consider the noncoherent setting only. 
Consequently, we will refer to capacity in the noncoherent setting simply as 
capacity. Furthermore, we shall assume Rayleigh fading throughout. 

^When Q = T, capacity is known to grow double-logarithmically in SNR [V), 
and, hence, the capacity pre-log is zero. 

^The constant block-fading model is obviously a special case ((3 = 1) of the 
correlated block-fading model. 



conjecture in |[8] on the MIMO capacity pre-log implies that the 
capacity pre-log in the SIMO case would be the same as that in 
the SISO case. In this paper, we disprove the conjecture in |8| by 
showing that in the SIMO case a capacity pre-log of 1 — 1/T can 
be obtained when the number of receive antennas is equal to Q, 
and the channel covariance matrix satisfies a certain technical 
condition detailed in Theorem [T] 

Notation: Uppercase boldface letters denote matrices, and 
lowercase boldface letters designate vectors. The all-zero matrix 
of appropriate size is written as 0. The element in the ith row 
and jth column of a matrix A is denoted as aij, and the ith com- 
ponent of the vector u is Ui. For a vector u, diag(u) denotes the 
diagonal matrix that has the entries of u on its main diagonal. The 
superscripts ^ and ^ stand for transposition and Hermitian trans- 
position, respectively. The expectation operator is denoted as E [•] . 
For two matrices A and B, we designate the Kronecker product 
as A (g) B; to simplify notation, we use the convention that the 
ordinary matrix product always precedes the Kronecker product, 
i.e., AB (g) C = (AB) (g) C. For two functions f{x) and g{x), 
the notation /(x) — C9((7(a;)) means that lima; _j.oo 
is bounded above by a constant. We use [n : m] to designate the 
set of natural numbers {n, n + 1, . . . , m}. Let g(u) be a vector- 
valued function; then dg/du denotes the Jacobian matrix of the 
function g(u), i.e., the matrix that contains the partial derivative 
dgi /duj in its ith row and jth column. We write \I\ to denote the 
cardinality of the set I. For an AI x N matrix A, and two sets of 
indices I C [1 : M] and C [1 : A^], we use [A]^; j to denote the 
\I\ X \ J'\ submatrix of A containing the elements [0'i,j]i^x jej- 
Similarly, for an -dimensional vector u and a set I C [1 : A^], 
we define Uj ^ [ui]i,=x- For an AI x N matrix A, we set 



= [A][i:M],j and [A]^^^ = [A; 



Furthermore, 



D{A) 



diag([aii . . . aMiV) 



diag([aijv • • ■ aMNV), 



(1) 



The eigenvalues of the x TV matrix A are denoted by Ai (A) > 
• • • > Ajv(A). The logarithm to the base 2 is written as log(-). 
Finally, C7V(m, C) stands for the distribution of a jointly proper 
Gaussian (JPG) random vector with mean m and covariance 
matrix C. 

II. System Model 

We consider a SIMO channel with Q receive antennas. The 
fading in each component channel follows the correlated block- 
fading model described in the previous section, namely, it is 
independent across blocks of length T, and correlated within 



blocks, with the rank of the corresponding channel covariance 
matrix given by Q < T. Note that we assume the rank of the 
channel covariance matrix to be equal to the number of receive 
antennas. Our analysis relies heavily on this assumption. Across 
component channels, the fading is independent and identically 
distributed. The input-output (I/O) relation (within any block) 
for the TOth component channel can be written as 

Ym = 4p diag(h„i)x + w„i, me [1 : Q] 

where the vector x — \x\ - ■ ■ xt^ G contains the T- 
dimensional signal transmitted within the block, and the vectors 
Yrm Wm € C"^ Contain the corresponding received signal and ad- 
ditive noise, respectively, at the mth antenna. Finally, h„j e 
contains the channel coefficients between the transmit antenna 
and the mth receive antenna. We assume that w,„ ~ CA/'(0, It) 
and hm ~ CA/^(0, PP"^) are mutually independent (and inde- 
pendent across m) and that P e C^^'^ (which is the same for 
all blocks) has rank Q < T. It will turn out convenient to write 
the channel-coefficient vector in whitened form as = Ps™, 
where s™ ^ C7V^(0, Ig). Finally, we assume that s„j and w,„ 
change in an independent fashion from block to block. 

If we define = [yj • • • yg], = [^7 ■ • • Sg], ^ 
■ ■ ■ Wg], and X = diag(x), we can write the channel I/O 
relation in the following — more compact — form 

y = Vp(Iq<»XP)s + w. (2) 

The capacity of the channel (p} is defined as 



C(p)^(l/r) sup/(x;y) 
/=c(-) 



(3) 



where the supremum is taken over all input distributions /x( ) 
that satisfy the average-power constraint E[||x|p] < T. 

III. Intuitive Analysis 

In this section, we describe a "back-of-the-envelope" method 
for guessing the capacity pre-log. A formal justification of this 



procedure is provided in Section IV 

The capacity pre-log characterizes the asymptotic behavior 
of the fading-channel capacity at high SNR, i.e., in the regime 
where the additive noise can "effectively" be ignored. In order 
to guess the capacity pre-log, we therefore consider the problem 
of identifying the transmit symbols Xi, i G [1:?"], from the 
noise-free (and rescaled) observation 



y^(Ig$5XP)s. 



(4) 



Specifically, we shall ask the question: "How many symbols 
Xi can be identified uniquely from y given that the channel 
coefficients s are unknown but the statistics of the channel, i.e., 
the matrix P, are known?" The claim we make is that the capacity 
pre-log is given by the number of these symbols divided by the 
block length T. 

We start by noting that the unknown variables in Q are s and 
X, which means that we have a quadratic system of equations. 
It turns out, however, that the simple change of variables Zi ^ 
l/xi, i € [1 : T], (we make the technical assumption < \xi\ < 
oo, i G [1 : T], in the remainder of this section) transforms (|4]) 
into a system of equations that is linear in s and z;, i e [1 : T]. 
Since the transformation Zi = l/xi is invertible for < Xi < oo. 



uniqueness of the solution of the linear system of equations in 
s and Zi, i e [1 : T], is equivalent to uniqueness of the solution 
of the quadratic system of equations in s and Xj, i e 
For simplicity of exposition and concreteness, we consider the 
special case T = 3 and Q = 2. A direct computation reveals 
that Q is equivalent to 
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The solution of this linear system of equations is not unique, as 
we have 6 equations in 7 unknowns. The Xi = 1/ Zi, i G [1:3], 
can, therefore, not be determined uniquely from y. However, if 
we transmit one pilot symbol and two data symbols, the system 
of equations becomes solvable. Take for example xi = 1 and 
let the receiver know the value of this (pilot) symbol. Then (|5]l 
reduces to the following inhomogeneous system of 6 equations 
in 6 unknowns 
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This system of equations has a unique solution if det B 7^ 0. 
We prove in Appendix |A] that under the technical condition 
on P specified in Theorem [T| below, we, indeed, have that 
det B ^ for almost alQy2 ,y3,y5,ye- It, therefore, follows that 
for almost all y, the system of equations ^ has a unique solution. 
Consequently, we can recover Z2 and Z3, and, hence, X2 = l/z2 
and X3 = I/Z3. Summarizing our findings, we expect that the 
capacity pre-log of the channel (|2]|, for the special case T — 3 
and Q = 2, is equal to 2/3. This is larger than the capacity 
pre-log of the corresponding SISO channel (i.e., the capacity 
pre-log of one of the component channels), which is equal to 
l-Q/T= 1/3 §. 

In general, we expect that under some technical conditions 
on P the capacity pre-log of the SIMO channel as defined in 
Section [n] is equal to (T - l)/r = 1 - 1/T. This is exactly 
what we intend to show rigorously in the next section. 

IV. A Lower Bound on the Capacity Pre-Log 

The main result of this paper is the following theorem. 

Theorem 1: Assume that there exists a subset of indices I C 
[1:T] of cardinahty \I\ = Q + 1 such that the [{Q + 1) x 
Q] -dimensional submatrix P = [P]io °f ^he matrix P in (|2| 
satisfies the following Property (A): Any set of Q rows of P is 
linearly independent. Then, the capacity of the SIMO channel (|2| 
can be lower-bounded as 



Cip) > (1 - l/T) log(p) + 0{l), p ^ 00. 



Except for a set of measure zero. 



(7) 



Remark 1: For the special case T = Q + 1, (|7]) yields a lower 
bound on the capacity pre-log that is tight. A matching upper 
bound can be obtained through steps similar to those in the proof 
of |[8] Prop. 4]. Establishing tight upper bounds on the capacity 
pre-log for general values of T, however, seems to be an open 
problem. Different tools than those used in |8| are probably 
needed. 

Remark 2: When Q = 1, the channel in (|2]) reduces to a SISO 
constant block-fading channel, and the lower bound (j7| yields 
the correct capacity pre-log of Q, Q. 

Remark 3: Property (A) is not very restrictive and is satisfied 
by many practically relevant matrices P. For example, removing 
any set of T— Q columns from aTxT discrete Fourier transform 
(DFT) matrix, results in a matrix that satisfies Property (A) when 
T is prime ||9|. DFT covariance matrices occur naturally in basis- 
expansion models for time-varying channels |8| . 

Proof: We choose an input distribution for which the en- 
tries Xi, i E [l:r], of X, are independent and identically 
distributed (i.i.d.), have zero mean and unit variance, and sat- 
isfy E[log|a;i|] > — oo and h{xi) > — oo. For example, we 
can take Xi ~ CAf{0, 1). We then lower-bound /(x; y) in ([3]l, 
evaluated for this input distribution. More precisely, we use 
/(x; y) ~ h{y) — h{y \ x) and bound the two differential entropy 
terms separately. Note that the class of input distributions for 
which (|7| holds is large. This does not come as a surprise, as we 
are interested in the capacity pre-log only. 

As y conditional on x is JPG, the conditional differential 
entropy h{y\x.) can be upper-bounded in a straightforward 
fashion as follows: 

/i(y|x) = Qriog(7re) 

+ E,, [log det (Iqt + P (Iq ® XP) [ss"] (Iq P"X") )] 

< Q^ogip) + 0{1) (8) 

where the inequality holds because P has rank Q. 

Finding a tight lower bound on h{y) is the main difficulty of 
the proof In fact, the differential entropy of y is often intractable 
even for simple input distributions. The main technical contribu- 
tion of this paper is presented in Section [rV-A| below, where we 
show that if Property (A) is satisfied and if the input distribution 
satisfies the conditions specified at the beginning of this proof, 
we have 

hiy)>{T-l + Q^)logip) + c (9) 

where c, here and in the remainder of the paper, stands for a 
constanj^that is independent of p. Combining (|8]l and (|9k then 
yields the desired result. Note that in order to establish ^ it is 
sufficient to use that P has rank Q, whereas the more restrictive 
Property (A) is crucial to establish (|9]l. ■ 

A. A Lower Bound on h{y) 

The main idea of our approach is to relate h{y) to h{s, x) = 
h(s) + h{x), which is generally much simpler to compute 
than h{y). It is possible to relate the entropies of two random 
vectors in a simple way if the vectors are of the same dimension 
and are connected by a deterministic one-to-one (in the sense 
of 1 10 p. 7]) function. This is not the case for y and [s^ x^]^. It 



is, however, possible to show that if xi is a fixed parameter, then 
there is a deterministic one-to-one function between [s^ x ^ . -^j ] ^ 
and a specific subset d [1: TQ] of components of the noiseless 
version y of the output vector This allows us to relate h{y j \xi ) 
to ft,(s,X[2:T] l^i) = h{s,X[2:T])^ wMch will tum out to be 
sufficient for our purposes as h{y) can be linked to h{yj \ xi) 
according to ( [T2j i below. We now describe the details of the proof 
program outlined above. 

Lemma 2: Assume that the matrix P satisfies the conditions 
of Theorem [Hand take the submatrix P defined in Theorem [l] 
to consist of the first Q + 1 rows of P for simplicityJ^Let 



[l:r]U[T + l:T + Q + l] 

u [2r + 1 : 2r + Q + 1] u • • • 

u[(g-i)r + i:(Q-i)r + Q + i] 



(10) 



where IJ^I = T- 
yj : CT-l+Q^ 



\+Q-, and consider the vector-valued function 



yj(s,X[2:T]) = ((Iq®XP)s) 



J 



(11) 



parametrized by xi ^ 0. To simplify the notation we will not 
indicate this parametrization explicitly. The function yj{ ) is 
one-to-one almost everywhere (a.e.) on C"'"^^^'^ . 

Proof See Appendix [A] ■ 
The following comments on Lemma |2] are in order. For T — 
3 and Q = 2 as in the simple example in Section III J = 
[1:6], so that yj = y. Therefore, the one-to-one correspondence 
established in this lemma simply means that Q has a unique 
solution for fixed xi 0. For the proof of the lemma, it is 
crucial that zi 7^ is fixed. In fact, one can check that if none 
of the components of x is fixed, the resulting equivalent of the 
function yj{-) cannot be one-to-one, no matter how the set J' 
is chosen.Fixing xi 7^ in order to make the function yj( ) be 
one-to-one corresponds to transmitting a pilot, as done in the 
simple example in Section|in]by setting xi — 1. The cardinality 
of the set J', which determines the lower bound on the capacity 
pre-log, as we shall see below, is dictated by the requirement that 
y J and [s^ x^ . ^j] are of the same dimension, which implies 
that J7 must contain T—l + Q^ elements. The specific choice of 

in ( [TO| l simplifies the proof of the lemma (see Appendix [B|. 

Lemma |2] can be used to relate the conditional differential 
entropy h{yj \xi)to h{s, X[2:t])- Before doing so, we establish 
a simple lower bound on h{y) that is explicit in h{y j\xi). Let 
Af be the complement of J' in [1 : QT]. Then 



Hy) = h{yj,y^f) 

> \J\ log(p) 



= Hyj) + h{yM\yj) 

wj) + /i(yAA I s,x,yj) 

I wj) + /i(w7v) 
'rh{yj\xi)+c. 



(12) 



The value of this constant can change at each appearance. 



Through this chain of inequalities, we got rid of the noise w. 
This corresponds to considering the noise-free I/O relation Q 
in the intuitive explanation given in Section III Inserting |i7| = 
T — 1 + into ( [T2] l, we obtain the desired result (|9| provided 

^This assumption will be made in the remainder of the paper, without explicitly 
mentioning it again. 



that h{y j\xi) is finite, which will be proved by means of the 
following lemma. 

Lemma 3 (Transformation of differential entropy): Assume 
that g:C^— >C^isa continuous vector- valued function that 
is one-to-one a.e. on C^. Let u G be a random vector and 
V — g(u). Then 

hiy) = h{n) + E„[log |det(ag/9u)|] . 

Proof: The proof follows from the change-of- variable theo- 



rem for integrals 1 10 Thm. 7.26]. 

Let /a; J (x) denote the density of xi. Then 

Kfj I xi) = If XI {x)h{yj I xi = x)dx 



^(S,X[2:T] I Xi) 



log 



det 



9(S,X[2:T]) 



(13) 



where in the second equality we applied Lemma[3]to h{yj\xi — 
x), using that the function yj{ ) in ( [TT| ) is continuous and is 
one-to-one a.e. as shown in Lemma|2] The first term on the RHS 
of ([T3| satisfies 

/l(s,X[2:T] \xi)= h{s) + h{X[2:T]) > -OO 

where the inequality follows because the Xi,i G [2 : T], are i.i.d. 
and have finite differential entropy. It therefore remains to show 
that the second term on the RHS of ( pj) is finite as well. As the 
RHS of ([TTJ is linear in s, we have that 

dyj/ds = [lQ®XP]^^^. 

Furthermore, using fTTj Eq. (5), Sec. 7.2], the RHS of ([TT) can 
be rewritten as y J (s, X[2:T]) = ((S (g) lT)-D(P)x)jr, where the 
operator £>(•) was defined in ([T]i and 

= [si...sq]. (14) 

Hence, we have that 

dyj/dx[2:T] = [(S®It)I?(P)]jJ2:T]- 

To summarize, the Jacobian matrix in ( [T3| ) is given by 



5(S,X[2:T]) 



[(S®lT)i^(P)] 



(15) 

As shown in Appendix|B] the determinant of this Jacobian matrix 
can be factorized as follows 

|det [[Iq ^ XP]_y_^ [(S lT)DiP)]j ^2:T]] I 

|dGtMi(X)| |dctM2(S)| 



n 

je[Q+2:T] 



where 



X |detM3(P)| |detM4(S)| |detM5(X)| (16) 
M2(S)^S®Iq+i 



M3(P) ^ 



(Iq®p) [d{P) 



<>,[2:Q+1] 



M4(S) ^ 
M5(X) ^ 



(S-i®Iq) 0^ 
Iq 

I 







M[2:Q+1],[2:Q+1]. 

Hence, we can rewrite the second term on the RHS of ( [T3] l as 

dyj 



E, 



log 



det 



5(S,X[2:T]) 



ie[0+2:T] 



log 



9=1 



Ex[log|det]V[i(X) 



■Es[log|detM2(S) 
-Es[log|det M4(S) 



+ log|detM3(P)| 
+ Ex[log|detM5(X) 



(17) 



The first and the third term on the RHS of ( [T7] i are finite because 
S has i.i.d. Gaussian components. The fifth term is finite for the 
samereason, becauselog |det S~^| = — log | det S|. The second 
and the sixth term are finite because E[log \xi\] > —oo,i e 
[1 : T], by assumption. Finally, we show in Appendix [c| that the 
matrix M3(P) has full rank if Property (A) is satisfied. This then 
implies that the fourth term on the RHS of ( [T7] i is also finite. 

V. Conclusion and Further Work 

In this paper, we analyzed the noncoherent-capacity pre-log of 
a temporally correlated block-fading channel. We showed that, 
surprisingly, the capacity pre-log in the SIMO case can be larger 
than that in the SISO case. This result was established for the 
special case of the number of receive antennas being equal to the 
rank of the channel covariance matrix. Interesting open issues 
include extending the lower bound in Theorem [T] to an arbitrary 
number of receive antennas and finding a tight upper bound on 
the capacity pre-log. 

Appendix A 

Proof of Lemma |2j We need to show that the function 
yj'(s,X[2:T]) is one-to-one a.e. Hence, we can exclude sets of 
measure zero from its domain. In particular, we shall consider 
the restriction of the function yj(s, X[2:t]) to the set of pairs 
(s,X[2:T])> which satisfy (i) < \xi\ < oo for all i e [2:T]; 
(ii) the matrix S, defined in ([T4| is invertible; (ill) the sum 
J2q=i ^iqPjq 1^ uouzcro for all j e [Q + 2:T]. 

To show that this restriction of the function yj{') [which, 
with slight abuse of notation we still call yj{-)] is one-to-one, 
we take an element y from its range and prove that the equation 



yj(s',X[2:Ti) = y 



(18) 



has a unique solution in the set of pairs (s', x'^2-t] ') satisfying 
the constraints (p)-(|iii|i. The element y can be represented as 
y = ((Iq (X) XP) s) with X — diag( [xi xj^.^,,]^), where 



[2:T] 

(s, X[2:T]) satisfies the constraints Hence, ([T8]l can be 

rewritten in the following way 



((Iq®: 



(19) 



)X'P)s')j = ((Iq®XP)s)j 

with X' = diag^[xi {'x'^^-t])'^^^- To prove that ^T9\ has a 
unique solution, we follow the approach described in Section III 
and convert ([T9|l into a linear system of equations through 



a change of variables. In particular, thanks to constraint 
we can multiply both sides of ([T9| by [Ig and by 

[Iq ® 'KYj^j and perform the substitution z[ — l/x[, i £ [2 : T] . 
Then, we define z' — [z'2 . . . z'rp]^ and manipulate the equation 
such that all the unknowns are on one side of the equation and 
all the terms depending on the constant x[ are on the other side. 
These steps yield the following inhomogeneous linear system 
of equations 



G 



s 

-z' 



1 



(20) 



where 



G 



T] 



(21) 



= [p|si 0_-_0 p7s2 0_-_0 ••• pJsQ 0_--_0]^ 



T-l times 



Q times 



and pj denotes the first row of P. The solution of ( |20] i is unique 
if and only if det G 7^ 0. To establish this, it is useful to note that 
the matrix G has a structure similar to the matrix on the RHS 
of ( [T5] l [the only difference is that X in ( [T5] l is replaced by X^^ 
in G]. Therefore, we can factorize det G in exactly the same 
way as the RHS of ( [T5| ). Finally, we invoke the constraints ([n]), 
and (|iii| together with Property (A) to conclude that each term in 
the resulting factorization is nonzero. This completes the proof. 

We point out that for T = 3 and Q = 2, the matrix B defined 
in ^ is related to G in ( |2T] i according to B = (I2 (E) X)G, and 
therefore det B 7^ a.e., as claimed in Section [Till 

Appendix B 

Proof of ^T6\ : As a consequence of the choice of in ( fTO] ), 
each of the last T — Q — 1 columns of the matrix on the RHS 
of ^T5\ has exactly one nonzero element. This allows us to use the 
Laplace formula to expand the determinant along these columns 
iteratively to get 

|det [[Iq ® XP]_y^^ [(S (g> It)D{P)]j^^,.^^^] | 



n 

je[Q+2:T] 



E 



SlqPjq 



I det (E) I (22) 



where 



E=[Iq®XP (S®Iq+i)D] 



D 



DiP) 



o.[2:Q+l] 



^ - W[1:Q+1],[1:Q+1] 



Next, using simple properties of the Kronecker product and 
exploiting the block-diagonal structure of D, we factorize E 
into a product of simple terms: 



E= Iq®X (S®Iq+i 



(S-1(8)Iq) 0' 
Iq 







D 


-1 

[2:Q+1],[2:Q + 1] 



(23) 



The proof is completed by inserting ( [23| l into p2| ), and using the 
multiplicativity of the determinant. 



Appendix C 

Lemma 4: Let A be an [N + 1) x matrix. If any set of N 
rows of A is linearly independent, then the N{N-\-l) x N{N+1) 
matrix A defined as A — [(Iat A) [£'(A)]^ w+i]] has 
full rank. 

Proof: The proof is by contradiction. Assume that A 
does not have full rank. Then there exists an N{N + 1)- 
dimensional nonzero vector = [u]^ 
u„ e C^+\ such that k = 
[(Ijv ® A 



u , where 



Because A — 
-D(A)]^ j2:Ar+i]], we havc in particular that 



(i) vJ{In®A) = and (ii) [i?(A)]^ ^^^^^j = 0. We 
next analyze these two equalities separately. Equality (i) can 
be restated as uJ^A = for all n e [1:^], which implies 
that all vectors u„ lie in the kernel of the N x (TV + 1) matrix 
A^. Because A^ has rank N, its kernel must be of dimension 
1. Hence, all vectors u„ must be coUinear, i.e., there exists a 
vector V and a set of N constants c„ such that u„ = c„v, for 
all 71 e [1 : The vector v and at least one of the constants c„ 
must be nonzero because u is nonzero. Furthermore, because 
v^A = 0, and because any set of N rows of A is linearly inde- 
pendent by assumption, all components of v must be nonzero. 

We now use this property of v to analyze equality (ii), which 
can be restated as 



or, after straightforward manipulations, as 



= 



[diag(v)][2: 



[A]. 



0. 



: Af+l],[2:Af+l] l^J [2 : JV+1] ,o Fi • • • CjvJ 

Because all the components of v are nonzero, this last equal- 
ity implies that [A]j2.^_|_i] ^[cf ■ ■ cat]^ — 0. However, this 
contradicts the assumption that any set of N rows of A is 
linearly independent (recall that at least one of the constants 
c„ is nonzero). Hence, A must have full rank. ■ 
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